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Analyzing coherent gene expression patterns is an important task in bioinformatics 
research and biomedical applications. Recently, various clustering methods have been 
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expression patterns as the centroids of the clusters. However, the interpretation of co- 
expressed genes and coherent patterns mainly depends on the domain knowledge, which 
presents several challenges for coherent pattern mining and cannot be solv ... 

Convergent algorithms for collaborative filtering 
Jon Kleinberg, Mark Sandler 

June 2003 Proceedings off the 4th ACM conference on Electronic commerce EC '03 
Publisher: ACM Press 

Full text available* pdf(282 34 KB) Additional Information: full citation , abstract , references , citings , index 

~ terms 

A collaborative filtering system analyzes data on the past behavior of its users so as to 
make recommendations — a canonical example is the recommending of books based on 
prior purchases. The full potential of collaborative filtering implicitly rests on the premise 
that, as an increasing amount of data is collected, it should be possible to make 
increasingly high-quality recommendations. Despite the prevalence of this notion at an 
informal level, the theoretical study of such convergent algori ... 



B 



B 



Keywords: clustering, collaborative filtering, latent class models, mixture model 



3 Content-based filtering & collaborative filtering: A nonparametric hierarchical 
bayesian framework for information filtering 
Kai Yu, VolkerTresp, Shipeng Yu 

July 2004 Proceedings of the 27th annual international ACM SXGIR conference on 

Research and development in information retrieval SXGXR '04 
Publisher: ACM Press 




http://portal.acm.org/results.cfm?coll=ACM&dl=ACM&CFID=13928189&CFTO 2/8/2007 



Results (page 1): vigilance parameter "cluster structure" "user preference" 



Page 2 of 7 



Full text available: ^ pdf(425.46 KB) Additional Information: full citation , abstract , references , citings , index 

terms 

Information filtering has made considerable progress in recent years. The predominant 
approaches are content-based methods and collaborative methods. Researchers have 
largely concentrated on either of the two approaches since a principled unifying 
framework is still lacking. This paper suggests that both approaches can be combined 
under a hierarchical Bayesian framework. Individual content-based user profiles are 
. generated and collaboration between various user models is achieved via a co ... 
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Clustering is the unsupervised classification of patterns (observations, data items, or 
feature vectors) into groups (clusters). The clustering problem has been addressed in 
many contexts and by researchers in many disciplines; this reflects its broad appeal and 
usefulness as one of the steps in exploratory data analysis. However, clustering is a 
difficult problem combinatorially, and differences in assumptions and contexts in different 
communities has made the transfer of useful generic co ... 
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Contextual preferences take the form that item il is preferred to item /' 2 in the context of 

X. For example, a preference might state the choice for Nicole Kidman over Penelope Cruz 
in drama movies, whereas another preference might choose Penelope Cruz over Nicole 
Kidman in the context of Spanish dramas. Various sources provide preferences 
independently and thus preferences may contain cycles and contradictions. We reconcile 
democratically the preferences accumulated from ... 
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It is crucial in many information systems to organize short text segments, such as 
keywords in documents and queries from users, into a well-formed taxonomy. In this 
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article, we address the problem of taxonomy generation for diverse text segments with a 
general and practical approach that uses the Web as an additional knowledge source. 
Unlike long documents, short text segments typically do not contain enough information 
to extract reliable features. This work investigates the possibilities of u ... 
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Publisher: ACM Press 

Full text available: ^pdf(166.69 KB) Additional Information: full citation , abstract , index terms 

This is a summary of the Advances in Model-Based Software Testing (A-MOST) workshop 
held on May 15-16, 2005, in St. Louis, Missouri. The workshop had approximately 40 
participants. The goals of this workshop were to offer a comprehensive overview of 
model-based testing to the ICSE community, and bring the researchers and practitioners 
together to discuss advances, applications, and the complex problems yet to be solved in 
model-based testing. The workshop was lively and maintained participant in ... 
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Electronic concept mapping tools empower experts to play an active role in the knowledge 
capture process, and provide a medium for building richly connected multimedia 
knowledge models— sets of linked concept maps and resources about a particular 
domain. Knowledge models are intended to be used as a means for sharing knowledge 
among humans, not as carefully-crafted knowledge bases upon which machines will be 
performing inference. However, users must still confront the questions of what t ... 
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UML diagrams have become increasingly important in the engineering and reengineering 
processes for software systems. Of particular interest are UML class diagrams whose 
purpose is to display class hierarchies (generalizations), associations, aggregations, and 
compositions in one picture. The combination of hierarchical and non-hierarchical relations 
poses a special challenge to a graph layout tool. Existing layout tools treat hierarchical 
and non-hierarchical relations either alike or as separat ... 

Keywords: UML Class Diagrams, graph drawing, graph layout, software visualization 
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^ June 1999 ACM SIGMOB Record , Proceedings of the 1999 ACM! SIGMOD international 
conference on Management of data SIGMOD '99, volume 28 issue 2 

Publisher: ACM Press 

Full text available - E nSj pdf(1,77 MB) Additional Information: full citation , abstract , references , citings , index 
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Cluster analysis is a primary method for database mining. It is either used as a stand- 
alone tool to get insight into the distribution of a data set, e.g. to focus further analysis 
and data processing, or as a preprocessing step for other algorithms operating on the 
detected clusters. Almost all of the well-known clustering algorithms require input 
parameters which are hard to determine but have a significant influence on the clustering 
result. Furthermore, for many real-data sets there doe ... 

Keywords: cluster analysis, database mining, visualization 
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In this paper, we investigate epidemiological models to reason about computer viral 
propagation. We extend the classical homogeneous models to incorporate two timing 
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parameters: Infection delay and user vigilance. We show that these timing parameters 
greatly influence the propagation of viral epidemics, and that the explicit treatment of 
these parameters gives rise to a more realistic and accurate propagation model. We 
validate the new model with simulation analysis. 

Keywords: anti-virus, computer virus, epidemiology, security 
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Publisher: ACM Press 

Full text available: ^ pdf(367.51 KB) Additional Information: full citation , abstract , references , index terms 

It is widely recognized that developing efficient and fully automated algorithms for 
clustering large transactional datasets is a challenging problem. In this paper, we propose 
a fast, memory-efficient, and scalable clustering algorithm for analyzing transactional 
data. Our approach has three unique features. First, we use the concept of Weighted 
Coverage Density as a categorical similarity measure for efficient clustering of 
transactional datasets. The concept of weighted coverage density is in ... 

(Keywords: AMI, LISR, SCALE, weighted coverage density 
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April 2006 ACM Transactions on Information Systems (TOIS), Volume 24 issue 2 
Publisher: ACM Press 

Full text available: *Q pdf(4.48 MB) Additional Information: full citation , abstract , references , index terms 

With continued advances in communication network technology and sensing technology, 
there is astounding growth in the amount of data produced and made available through 
cyberspace. Efficient and high-quality clustering of large datasets continues to be one of 
the most important problems in large-scale data analysis. A commonly used methodology 
for cluster analysis on large datasets is the three-phase framework of 
sampling/summarization, iterative cluster analysis, and disk-labeling. There are th ... 

Keywords: Clustering, interactive visualization, labeling, large datasets, performance 



□ 




6 Learning user preferences for sets of objects 
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* June 2006 Proceedings of the 23rd international conference on Machine learning 
XCML '06 
Publisher: ACM Press 
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Most work on preference learning has focused on pairwise preferences or rankings over 
individual items. In this paper, we present a method for learning preferences oversets of 
items. Our learning method takes as input a collection of positive examples— that is, one 
or more sets that have been identified by a user as desirable. Kernel density estimation is 
used to estimate the value function for individual items, and the desired set diversity is 
estimated from the average set diversity o ... 
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modeled as a complex adaptive system with listener event graph objects (LEGOS) 
Joerg C. G. Wellbrink, Arnold H. Buss 

December 2004 Proceedings of the 36th conference on Winter simulation WSC '04 
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Full text available: *^ pdf(329.97 KB) Additional Information: full citation , abstract , references 

There has been an increasing need to incorporate human performance in simulation 
models. Situations in which human performance is subject to degradation overtime, such 
as vigilance tasks, are not represented. This article describes a computational model for 
vigilance performance embedded in a new cognitive framework that utilizes recent 
advances in system neuroscience, evolutionary psychology and complexity theory. The 
Reduced Human Performance Model (RHPM) captures human errors in monitoring t ... 

18 Trust, recommendations, evidence, and other collaborative know-how (TRECK): A [ | 
fuzzy model for reasoning about reputation in web services 
Wanita Sherchan, Seng W. Loke, Shonali Krishnaswamy 

April 2006 Proceedings of the 2006 ACM symposium on Applied computing SAC '06 
Publisher: ACM Press 

Full text available: *^ pdf(1 85.79 KB) Additional Information: full citation , abstract , references , index terms 

Reputation systems are typically based on ratings given by the users. When there are no 
mechanisms in place to detect collusion and deception, combining user testimonies as 
such to form a provider's reputation may not give an accurate assessment, especially if 
the context of the ratings is not known. Moreover, such. systems are vulnerable to 
manipulations by malicious users. Hence it becomes essential to establish the validity of 
the ratings prior to using them in formulating reputation based on ... 

[Keywords: rationale inference, reputation, web services 
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Gonzalo Ramos, Ravin Balakrishnan 

October 2005 Proceedings of the 18th annual ACM symposium on User interface 

software and technology UIST '05 
Publisher: ACM Press 



Full text available - ^ pdf(834 26 KB) Add ' t ' onal Information: full citation , abstract , references , citings , index 
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High precision parameter manipulation tasks typically require adjustment of the scale of 
manipulation in addition to the parameter itself. This paper introduces the notion of Zoom 
Sliding, or Zliding, for fluid integrated manipulation of scale (zooming) via pressure input 
while parameter manipulation within that scale is achieved via x-y cursor movement 
(sliding). We also present the Zlider (Figure 1), a widget that instantiates the Zliding 
concept. We experimentally evaluate three different inp ... 

Keywords: input, multi-scale navigation, pen-based interfaces, pressure widgets 
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